Statistical estimation with model selection Lucien
نویسنده
چکیده
The purpose of this paper is to explain the interest and importance of (approximate) models and model selection in Statistics. Starting from the very elementary example of histograms we present a general notion of finite dimensional model for statistical estimation and we explain what type of risk bounds can be expected from the use of one such model. We then give the performance of suitable model selection procedures from a family of such models. We illustrate our point of view by two main examples: the choice of a partition for designing a histogram from an n-sample and the problem of variable selection in the context of Gaussian regression.
منابع مشابه
Statistical estimation with model selection
The purpose of this paper is to explain the interest and importance of (approximate) models and model selection in Statistics. Starting from the very elementary example of histograms we present a general notion of finite dimensional model for statistical estimation and we explain what type of risk bounds can be expected from the use of one such model. We then give the performance of suitable mo...
متن کاملModel selection for density estimation with L2-loss
We consider here estimation of an unknown probability density s belonging to L2(μ) where μ is a probability measure. We have at hand n i.i.d. observations with density s and use the squared L2-norm as our loss function. The purpose of this paper is to provide an abstract but completely general method for estimating s by model selection, allowing to handle arbitrary families of finite-dimensiona...
متن کاملSe p 20 06 Model selection for Poisson processes Lucien
Our purpose in this paper is to apply the general methodology for model selection based on T-estimators developed in Birgé (2006a) to the particular situation of the estimation of the unknown mean measure of a Poisson process. We introduce a Hellinger type distance between finite positive measures to serve as our loss function and we build suitable tests between balls (with respect to this dist...
متن کاملModel selection for Poisson processes
Our purpose in this paper is to apply the general methodology for model selection based on T-estimators developed in Birgé (2006a) to the particular situation of the estimation of the unknown mean measure of a Poisson process. We introduce a Hellinger type distance between finite positive measures to serve as our loss function and we build suitable tests between balls (with respect to this dist...
متن کاملAn all-or-nothing phenomenon for superefficiency
In his 1953 paper Lucien Le Cam proved for regular univariate statistical models that sets of points of superefficiency have Lebesgue measure zero (in fact, these sets are even countable). Considering only computable estimators, it is possible to show that no computable parameter point can be a point of superefficiency. This strengthens Le Cam’s result to a dichotomy: either a parameter point θ...
متن کامل